Special Issue on Advances in Data Mining and Robust Statistics

نویسندگان

  • Michael W. Berry
  • Jung Jin Lee
  • Giovanni Montana
  • Stefan Van Aelst
  • Ruben H. Zamar
چکیده

The main aim of data mining is to extract knowledge from, usually very large, datasets. Data mining techniques are often applied to gain initial insights about the data and complement statistical models. This special issue focuses on the interface between data mining and statistical modelling, with special emphasis on robust statistics. Very large datasets, especially those that are machine generated and undergo limited quality control, are likely to contain outliers and anomalous measurements. The analysis of such datasets requires statistical approaches that are both computationally efficient and robust against outliers and mild departures from model assumptions. The broad scope includes, but is not limited to, visualization techniques for very large and complex data, including relational data, data analysis algorithms including optimisation and search techniques, methodologies to draw inference on patterns and subgroups, robust models, outlier detection methods, and the analysis of dependencies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Special Issue on Data Mining and Pattern Analysis in Computational Bioscience

This special issue provides a collection of papers that report recent advances in computational bioscience with a focus on biological pattern discovery and data mining. The special issue begins with a meeting report, followed by five articles. In “The DNA–Proteome: Recent advances towards establishing the protein–DNA interaction space,” Erich Grotewold and Herbert Auer summarize findings from t...

متن کامل

The Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran

The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...

متن کامل

A robust least squares fuzzy regression model based on kernel function

In this paper, a new approach is presented to fit arobust fuzzy regression model based on some fuzzy quantities. Inthis approach, we first introduce a new distance between two fuzzynumbers using the kernel function, and then, based on the leastsquares method, the parameters of fuzzy regression model isestimated. The proposed approach has a suitable performance to<b...

متن کامل

Robust production scheduling in open-pit mining under uncertainty: a box counterpart approach

Open-Pit Production Scheduling (OPPS) problem focuses on determining a block sequencing and scheduling to maximize Net Present Value (NPV) of the venture under constraints. The scheduling model is critically sensitive to the economic value volatility of block, block weight, and operational capacity. In order to deal with the OPPS uncertainties, various approaches can be recommended. Robust opti...

متن کامل

Special issue on imprecision in statistical data analysis

Different sources of imprecision in connection with both empirical data or models may arise in statistical data analysis. Such an imprecision is frequently connected with linguistic data, expert opinions, perceptions as well as various kinds of ill-observed data or non-precisely defined concepts. Fuzzy sets, intervals, belief functions, random sets or imprecise probability models have been used...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 93  شماره 

صفحات  -

تاریخ انتشار 2016